The University of Amsterdam at WebCLEF 2007: Using Centrality to Rank Web Snippets

نویسندگان

Valentin Jijkoun

Maarten de Rijke

چکیده

We describe our participation in the WebCLEF 2007 task, targeted at snippet retrieval from web data. Our system ranks snippets based on a simple similarity-based centrality, inspired by the web page ranking algorithms. We experimented with retrieval units (sentences and paragraphs) and with the similarity functions used for centrality computations (word overlap and cosine similarity). We found that using paragraphs with the cosine similarity function shows the best performance with precision around 20% and recall around 25% according to human assessments of the first 7,000 bytes of responses for individual topics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Centrality to Rank Web Snippets

متن کامل

REINA at WebCLEF 2007. Selecting Useful Snippets

The task for this year consist in retrieve snippets or pieces of text from web documents about several topics. The extraction of such snippets can be approached in several ways, as well as the selection of most usefull of them. We describe the segementation process adopted, and the selection of snippets carried out.

متن کامل

Overview of WebCLEF 2008 (Draft)

We describe the WebCLEF 2008 task. Similarly to the 2007 edition of WebCLEF, the 2008 edition implements a multilingual “information synthesis” task, where, for a given topic, participating systems have to extract important snippets from web pages. We detail the task and the assessment procedure. At the time of writing evaluation results are not available yet.

متن کامل

Overview of WebCLEF 2008

We describe the WebCLEF 2008 task. Similarly to the 2007 edition of WebCLEF, the 2008 edition implements a multilingual “information synthesis” task, where, for a given topic, participating systems have to extract important snippets from web pages. We detail the task, the assessment procedure, the evaluation measures and results.

متن کامل

Segmentation of Web Documents and Retrieval of Useful Passages

This year’s WebCLEF task was to retrieve snippets and pieces from documents on various topics. The extraction and the choice of the most widely used snippets can be carried out using various methods. This article illustrates the segmentation process and the choice of snippets produced in this process. It also describes the tests carried out and their results.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

The University of Amsterdam at WebCLEF 2007: Using Centrality to Rank Web Snippets

نویسندگان

چکیده

منابع مشابه

Using Centrality to Rank Web Snippets

REINA at WebCLEF 2007. Selecting Useful Snippets

Overview of WebCLEF 2008 (Draft)

Overview of WebCLEF 2008

Segmentation of Web Documents and Retrieval of Useful Passages

عنوان ژورنال:

اشتراک گذاری